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ABSTRACT 
The  effects  of  parametric  mis specification  on  Leontief  Input- 
Output  models  are  discussed,  and  a  method  of  estimating  uncertainty 
"bounds  on  the  solution  and  the  impact  of  identifying  and  updating 
the  most  important  small  suhset  of  model  parameters  is  presented. 
To  facilitate  projection  of  future  values  of  technological  coeffi- 
cients, a  method  for  redefining  sectors  to  reflect  real  technological 
alternatives  is  shown.   It  complements  the  sensitivity  analysis 
techniques  presented  by  further  reducing  the  nvaabev   of  parameters  to 
be  updated,  while  increasing  the  certainty  with  which  they  can  be 
forecasted. 


1.   INTRODUCTION 

Since  their  introduction  over  thirty  years  ago,  most  input-output 
(l-O)  models  have  typically  been  based  on  data  necessarily  aggregated 
to  less  than  150  sectors  because  of  computational  limitations.   At 
this  level,  the  models'  parameters  (technological  coefficients)  have 
been  shown  to  be  sufficiently  constant  over  time  to  permit  utili- 
zation  as  a  forecasting  tool. 

Recently,  I-O  models  have  become  available  at  the  360  to  ^80- 
sector  levels  of  detail.   It  has  not  been  possible  to  determine  whether 
these  coefficients  are  constant  over  time,  due  to  a  lack  of  sufficient 
data.    There  is  a  possibility  that  at  this  level  of  detail  they 
may  be  less  stable  and  possibly  less  predictable. '   The  usefulness 
of  these  detailed  1-0  models  for  such  purposes  as  technology  assessment 
and  output  forecasting  is  therefore  questionable. 

In  this  paper  we  shall  demonstrate  a  method  for  restructuring  1-0 
models  so  the  parameters  more  accurately  reflect  real  technological 


*  The  necessity  of  the  fixed  parameter  assumption  results  also  from 
the  typical  seven  year  lag  between  measurement  and  publication  of 
U.  S.  input-output  tables.   Stability  of  technological  coefficients 
has  been  studied  by  Carter  [7]  and  [l^.   For  a  discussion  of  estimation 
techniques,  see  Almon  [l]. 

**  Data  are  available  for  only  two  years ,  19^3  and  19^7  in  current 
collars  from  the  U.  S.  Department  of  Commerce  [lO],  and  price  deflators 
have  not  yet  been  published  at  the  levels  of  aggregation  necessary  to 
compare  them. 

t  For  example,  consider  technological  substitutions  such  as  aluminum 
for  copper  in  electric  transmission  cables,  shifts  among  transportation 
modes,  etc. 


options,  a  framework  in  which  parametric  forecasting  may  be  done  with 
greater  certainty.   Then  the  effect  of  uncertainty  on  model  outputs 
will  be  evaluated,  and  a  systematic  procedure  for  minimizing  it  will  be 
presented. 

Methods  for  quantifying  the  effects  of  possible  variations  of 
parameters  have  been  presented  by  Christ   L8 J  and  Berman    [2J.   We 
shall  employ  a  method  derived  by  Sebald  [12]  which  efficiently 
generalizes  Herman's  work,  and  extends  Christ's  results  to  obtain 
tighter  error  boTonds  for  a  broader  class  of  I-O  models.   In  particular, 
we  shall  investigate  several  methods  for  identifying  a  small  subset 
of  the  parameters  which  most  significantly  affect  the  results.   Then 
these  paraineters  will  be  updated  and  the  effects  on  the  uncertainty  of 
model  outputs  will  be  evaluated  in  light  of  three  suggested  applications 
of  the  method:   first,  to  establish  priorities  for  updating  only  key 
technological  coefficients  rather  than  entire  1-0  tables;''"  second,  to 


*  For  a  limited  class  of  1-0  models  and  a  given  uncertainty  level  on 
the  technical  coefficients  A,  Christ  identified  the  most  troublesome 
values  of  the  perturbed  A  matrix,  and  gave  an  upper  bound  on  the 
element  by  element  difference  between  the  nominal  and  perturbed 
inverses. 

**  E.  Berman  of  the  U.  S.  Bureau  of  Mines  proposed  a  method,  involving 
the  Sherman-Morrison  equation,  for  identifying  those  parameters  in  A 
whose  variation  coiold  cause  a  significant  increase  in  the  output 
required  by  any  of  the  economy's  industrial  output  sectors. 

t  Plans  by  the  U.  S.  Department  of  Commerce  to  annually  update  1-0 
tables  have  been  repeatedly  delayed  due  to  budget  constraints,  but 
some  updates  may  be  available  within  the  year  at  an  aggregated  (87- 
sector)  level  of  detail.   Utilizing  the  methods  presented  here  it  may  be 
possible  to  use  these  limited  resources  to  update  the  detailed  tables, 
or  to  update  the  aggregated  ones  at  a  lower  cost. 
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identify  parameters  which  should  perhaps  be  estimated  more  accurately 
when  the  tables  are  initially  constructed;  and  third,  to  identify 
critical  points  in  the  system  where  small  technical  changes  could  most 
significantly  affect  the  output  requirements  from  specified  (e.g. 
energy)  sectors. 
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2.   MODEL  STRUCTURE 

p 
The  N  technical  coefficients  in  the  model  are  not  necessarily 

in  one  to  one  correspondence  vith  real  technological  options.   A 
single  technical  change  (e.g.  fuel  substitution  for  space  heating) 
may  affect  many  parameters  in  several  rows  and  columns  of  the  matrix. 
It  is  only  because  there  are  no  explicit  markets  in  certain  productive 
services  (e.g.  space  heat),  that  I-O  data  do  not  yield  production 
functions  that  explicitly  reflect  these  options.   To  simplify  the 
task  of  parameter  estimation  and  reduce  the  number  of  parameters 
affected  by  such  technical  changes,  we  present  a  method  for  restruc- 
turing the  model. 

We  begin  by  adding  a  set  of  ficticious  'energy  service'  sectors. 
They  are  defined  as  essentially  nonsubsti tut able  productive  service 
inputs  (e.g.  space  heat,  air  conditioning,  industrial  process  heat) 
which  could  be  provided  by  a  variety  of  combinations  of  fuel  inputs. 
We  represent  each  sector's  requirements  for  these  'energy  services' 
in  the  conventional  fixed  coefficient  framework,  so  that  these  fuel 
substitution  options  need  not  be  included  in  every  sector's  production 
function.   Thus  the  technical  coefficients  most  likely  to  change  as  a 
result  of  energy  price  changes  and  policies  are  localized  in  the 
small  partition  of  the  technical  coefficients  matrix  defined  by  the 


*     Since  energy  cooperates  primarily  with  capital  to  provide  productive 
services  (see  E.  Berndt  and  D.  Wood  [3]),  and  since  changes  in  capital 
requirements  (if  any)  associated  with  the  fuel  substitution  would  not 
affect  the  current  accoiint  coefficients,  we  assume  such  changes  in  A 
are  not  accompanied  by  substantial  changes  in  other  parameters  outside 
that  submatrix.   This  satisfies  the  conditions  set  forth  by  Leontief 
[lOb]  for  the  establishment  of  such  ficticious  sectors. 
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intersection  of  the  energy  supply  sector  rows  with  the  supply  and 
service  sector  columns.    This  subset  of  technical  coefficients  is 
essentially  a  set  of  'design  parameters'  for  a  nation's  energy  system. 
Models  are  available  to  solve  for  optimal  future  values  of  these 
■  parameters  "by  minimizing  fuel,"  capital  and  other  costs  of  producing 
a  specified  output  of  energy  services. 

The  energy  rows  of  the  technical  coefficients  matrix  are  con- 
structed from  physical  data."^  Unlike  models  where  all  sector 
outputs  are  expressed  in  xinits  of  current  dollars,  the  condition 
that  all  column  sums  are  less  than  or  equal  to  one  (with  at  least  one 
strict  inequality)  is  not  satisfied  here.   Since  the  existence  of 
a  feasible  Leontief  inverse  is  no  longer  guaranteed,  one  must 
explicitly  verify  that  the  spectral  radius  of  A  is  less  than  unity  [ll]. 
This  condition  is  satisfied  for  our  expanded  model.   Finally,  the 
lack  of  the  column  sum  condition  eliminates  the  possibility  of  utilizing 
special  sensitivity  analysis  techniques  such  as  that  developed  by 
Christ.   We  describe  a  more  generally  applicable  technique  in  the 
next  section. 


*   This  reduces  the  number  of  technical  coefficients  affected  by  such 
technological  changes  by  more  than  a  factor  of  ten. 

**  For  example,  the  Brookhaven  Energy  System  Optimization  Model 
(see  Hoffman  [9])  solves  the  first  of  the  firm's  two  step  cost 
minimization  problem,  yielding  an  implicit  price  for  energy  services. 
Only  if  these  prices  change  substantially  will  it  be  necessary  to 
modify  the  fixed  coefficients  in  the  energy  service  rows. 

t   This  method,  following  Leontief  [lOa]  accounts  for  the  fact  that 
energy  is  sold  to  various  sectors  at  vastly  different  prices.   As 
tables  are  disaggregated,  the  use  of  physical  data  becomes  possible  in 
many  rows,  since  the  product  mix  becomes  more  homogeneous  for  each 
sector.   Data  sources  are  discussed  and  construction  of  the  domestic 
base  I-O  tables  is  described  by  Bullard  and  Herendeen  [5]  and  by  the 
authors  [6]. 
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3.   UNCERTAINTY  AND  IMPORTANT  PARAMETERS 

Since  I-O  tables,  typically  published  every  five  years,  are 
generated  from  six  or  seven  year  old  data,  some  account  must  be  taken 
of  parametric  uncertainty  resulting  both  from  measurement  errors  and 
from  technological  change  over  time.   The  longer  the  planning  horizon 
for  the  predictive  applications  of  the  model,  the  greater  the  uncertain- 
ty. 

Consider  a  general  problem  whose  solution  S  is  of  the  form 

S  =  S  [(I-A)-^,  Y].  (1) 

Given  a  matrix  AA  such  that  A  +.  AA  represents  the  maximum  upper  (lower) 

boTonds  on  the  uncertainty  of  the  model's  parameters,  we  can  evaluate 

the  positive  and  negative  tolerances  on  the  solution.   Restricting 

our  attention  to  the  positive  case  for  notational  simplicity,  the 

tolerance  is  given  by 

^=   S[(I-A-AA)"^,  Y]  ^2) 

^   S[(l-A)-\  Y] 

This  same  relation  is  useful  for  quantifying  the  extent  to  which  output 

tolerances  could  be  reduced  by  more  accurately  forecasting  a  subset 

2 
of  the  parameters,  in  cases  where  N   is  large. 

We  shall  first  discuss  methods  for  estimating  maximum  error  bounds 
on  the  results  given  uncertainties  in  the  elements  of  the  matrix  A 
of  technological  coefficients.   Then  we  describe  a  technique  for 
identifying  a  subset  of  A  where  technical  changes  or  parametric  uncer- 
tainty would  have  the  greatest  impact  on  an  arbitrarily  specified 
importance  function. 
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3.1  Maximum  Error  Bounds 

Since  stochastic  error  analyses  are  unwieldy  for  problems  of  this 
type  and  scale,  we  restrict  our  attention  to  estimates  of  maximum 
error  bounds. 

A  simple  approach  utilizing  matrix  norm  analysis  boionds  the 
uncertainty  on  the  inverse  of  a  matrix  due  to  an  uncertainty  in  its 
elements  as  follows 


6C 


C  -^  -  (C+6C)"^| 


M 


(3) 


c-i||  11^^! 


1-M 


provided 


C  ^1  I  •  I  |6C|  I  <  1 


Where  M  =  the  condition  number  of  C  =  | |c| |  •  | |c   |  j .   Equation  (3) 
says  in  effect  that  given  a  percentage  perturbation  (in  the  no2rm  sense) 
on  C,  the  resulting  percentage  perturbation  on  C   will  be  less  than  or 
equal  to  M  times  as  large,  provided  that  |  |c   |  |  •  |  |6C|  |  is  sm.all.   As 
demonstrated  in  section  ^4,  this  technique  gives  a  veiy  loose  upper 
bound.   An  example  will  suffice  here  to  show  how  the  triangle  inequality 
on  which  (3)  is  based  could  produce  overly  conservative  results. 


*  Note  that  each  step  in  a  Monte  Carlo  analysis  woiild  require  inversion 
of  an  extremely  large  matrix.   In  addition,  ■unverifiable  assumptions 
regarding  the  statistics  of  the  W^  parameters  must  also  be  made. 
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Let  A  = 


P  °1 

[O      200j 


B  = 


I |ab) I  ^  I |a| I   •   I |b| I 

20       <      200      •        1 

a  very  conservative  bound. 

A  much  tighter  bound  on  the  inverse  uncertainty  may  be  obtained 
using  the  following  procedure,  which  involves  creation  of  two  perturbed 
A  matrices,  one  which  is  the  perturbation  causing  the  greatest  possible 
increase  (i.e.,  positive  tolerance)  on  all  elements  of  (I-A)    and  the 
other  which  causes  the  greatest  negative  tolerance  on  (I-A)   .   The 
inverses  of  these  two  matrices  give  the  worst  case  (positive  and 
negative)  element-by-element  tolerances  on  (I-A)   .   It  can  be  proved 
that  for  a  wide  spectrum  of  I-O  models,  these  perturbed  A  matrices 
are  easily  created  by  permitting  all  elements  of  A  to  assume  their 
maximum  (minimuin)  values  simultaneously.    This  gives  a  much  tighter 
worst  case  bound  than  the  norm  bound,  but  it  must  be  remembered  that 
tightness  of  the  bo\md  will  depend  on  the  likelihood  that  all  elements 
might  be  in  error  in  the  same  direction  simultaneously. 


*   See  Sebald  (op.  cit.  ) 


**     Since  row  and  column  sums  of  1-0  tables  (gross  outputs)  are 
usually  known  with  greater  certainty  than  interindustry  transactions, 
the  worst  case  tolerance  is  highly  unlikely. 
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3. 2  Identification  of  Important  Parameters 

2 

Since  N   is  large,  one  should  concentrate  on  updating  those 

parameters  which  contribute  most  to  the  uncertainty  of  the  solution. 
To  quantify  the  reduction  in  output  uncertainty,  we  shall  set 
AA. .  =  0  for  a  subset  of  most  "important  parameters  and  reevaluate 
eq.  (2).   To  identify  these  parameters  one  must  first  specify  the 
model  outputs  with  respect  to  which  importance  is  to  be  defined. 
Thus  we  define  an  importance  function  of  the  form 

J  =  f[(l-A-AA)"-^,  Y]  "  (i|) 

where  J  may  be  a  scalar,  vector,  or  matrix  expression  of  the  problem 
solution,  and  is  of  order  less  than  or  equal  to  that  of  (I-A)~  . 
After  specifying  an  uncertainty  level  on  elements  of  A,  one 
can  evaluate  the  resulting  AJ  using  the  Sherman  and  Morrison  relation. 
A  parameter  A. .  is  said  to  be  important  if  its  uncertainty  causes  some 
element  of  AJ  to  exceed  a  prescribed  threshold.    Its  importance  with 
respect  to  the  entire  J  is  given  by: 


EEx        where    X   =(  (5) 

mn  van 

0  otherwise 
T  is  the  importance  threshold  and  v  >_  1, 


*   See  J.  Sherman  and  W.  Morrison  [13] 


**  Note  that  a  brute  force  application  of  the  Sherman-Morrison 
relation  will  not  suffice  for  large  I-O  models.   If  m,  n,  i,  j 
e{l,2. . .,370},  1.9  x'IQIO  tests  would  be  required.   An  efficient 
method  for  reducing  the  number  of  these  tests  needed  to  identify  all 
i,  j,  m,  n  for  which  v  _>  1  is  presented  by  Sebald  (op.  cit.)  where 
this  'important  parameter'  problem  is  discussed  in  detail. 
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k .   RESULTS 

Sample  calculations  are  presented  for  four  classes  of  importance 
functions  (scalar,  vector,  submatrix,  or  the  entire  Leontief  inverse). 
For  each  case,  we  show  the  effect  of  'updating'  the  most  important  2% 
of  the  model's  parameters  on  the  output's  uncertainty  level. 

For  expository  simplicity,  the  results  we  present  here  are  for 
a  101  sector  model,  but  the  methods  are  applicable  and  efficient  at  any 
level  of  disaggregation, 
i+.l  Total  Electricity  Demand 

For  predicting  total  electricity  demand,  we  set  the  importance 
function,  J,  equal  to: 

J  =  u- (l-A)~^  Y  =  u-X 

i  =  electric  utilities    (6) 

,0    elsewhere 

where  u  is  a  row  vector  which  extracts  and  s\ims  the  total  requirements 
for  outputs  from  the  electric  utility  sectors.   We  assume  that  the 
nominal  values  of  the  technical  coefficients  over  the  'prediction' 
period  are  given  by  their  base  year  values.   The  tolerance  interval  on 
J  resulting  from  j^  10^  uncertainty  on  all  parameters  is  [-23.^^,  30. i+^]. 
This  tells  us  much  more  than  the  condition  number  criterion,  which 
states  only  that  our  10^  input  errors  coxild  be  magnified  up  to  ^9 
times. 


*  Sectors  correspond  to  the  standard  82-sector  Commerce  Department 
model  except  that  transportation  and  energy  sectors  are  fully 
disaggregated  and  eight  'energy  service'  sectors  are  added. 
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To  further  tighten  the  tolerance  interval,  we  identified  the  2% 
of  the  parameters  most  important  with  respect  to  the  scalar  function 
eq.  (6),  and  held  them  to  their  nominal  values.   This  improved  the 
tolerance  interval  to  [-U.2^,  ^.T^]. 
\.2     Demand  for  Energy  Services 

Extimates  of  future  requirements  for  energy  services  are  needed 
as  inputs  to  models  used  to  predict  the  cost-minimizing  mix  of  energy 
resources,  fuels,  and  energy  production  facilities  for  the  entire 
economic  system.    The  expanded  I-O  model  gives  these  in  eight  elements 
of  the  total  output  vector  X.   Selecting  the  importance  function,  J, 
as  the  vector  of  those  eight  elements,  we  identify  the  corresponding 
suhset  of  most  important  parameters.   Importance  in  this  case  is 
determined  "by  summing  over  the  number  of  times  a  parameter  satisfies 
the  condition  (5)  for  any  element  of  J.   Positive**  tolerances  on  the 
solution  vector  before  and  after  updating  the  most  important  2%   of  the 
parameters  are  shown  in  Table  1.   The  average  positive  tolerance 
reduction  was  from  about  l8^  to  U^. 


Before 
After 


Ore 

Reduction 

Feedstocks 

Other 
Feedstocks 

Motive 
Power 

Process 
Heat 

Water 
Heat 

Space 
Heat 

Air 
Condi- 
tioning 

Electric 
Drive  & 
Lighting 

32 

27 

8 

30 

5 

9 

12 

20 

6 

i| 

2 

T 

1 

3 

3 

6 

Table  1.   Energy  Service  Requirements:   Maximum  Upper 

Bounds  (Percent)  Before  and  After  'Correction' 
of  Most  Important  Parameters 


See  Hoffman,  (op.  cit.). 


**  Negative  tolerances  are  of  the  same  order  of  magnitude,  but  slightly 
smaller. 
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Tolerance  intervals  on  this  vector  function  vere  not  reduced 
evenly,  but  that  could  have  been  accomplished  by  appropriately 
weighting  J,  thereby  reducing  it  to  a  scalar  importance  function  of  the 
solution  vector. 
k.3     Energy  Cost  of  Goods  and  Services 

The  energy  supply  sector  rows  of  the  Leontief  inverse  matrix 
have  been  used  for  evaluating  energy  impacts  of  consiomer  choices,  and 
analyses  of  alternative  government  spending  programs  [5,^].   Selecting 
this  submatrix  as  our  importance  function,  and  examining  the  effects  of 
'updating'  the  most  important  2%   of  the  model's  parameters,  yields  the 
results  summarized  in  fig.  1.   Again  there  was  a  marked  reduction  in 
the  tolerance  interval,  from  an  average  of  hk%   before  updating  to  20^ 
after  in  the  positive  case.   Figure  1  shows  how  these  tolerances  were 
distributed  over  the  hundreds  of  elements  in  the  submatrix. 
h .k     Entire  Leontief  Inverse 

The  most  general  importance  function  would  be  the  set  of  all 

elements  comprising  the  Leontief  inverse  matrix.   The  distribution 

2' 
of  positive  tolerances  over  the  N   elements  of  this  importance  function 

is  shown  in  fig.  2.   The  mean  positive  tolerance  was  reduced  only  from 

33^  to  31^  (a  6%   reduction)  by  holding  the  most  important  2%   of  the 

parameters  to  their  nominal  values. 
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CORRECTED 
UNCORRECTED 


5-20 


:0-40 


40-60 


>60 


Figure  1.      Number  of  Energy  Intensities  Within  Various 

Percent  Tolerance   Intervals    (Effect  on  energy 
rows  of  inverse  matrix  resulting  from  correcting 
the  most   important   2%  of  the  direct  coefficients.) 


-13- 


CORRECTED 
UNCORRECTED 


5-20 


20-40 


40-60 


>60 


Figure   2.      Number  of  Inverse  Elements  Within  Various 
Percent   Tolerance   Intervals    (Effect  on  all 
elements  of  the  inverse  resulting  from 
correcting  the  most   important  2^  of  the 
direct   coefficients.) 
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5.   CONCLUSIONS 

Reducing  the  imcertainty  of  results  of  input-output  analyses 
requires  selective  updating  of  the  model's  parameters.   There  are  two 
ways  to  reduce  the  number  of  parameters  that  must  he  updated  to  achieve 
a  given  degree  of  output  certainty.   First  the  model  can  "be  structured 
to  effect  an  isomorphic  mapping  between  real  technological  options  and 
model  parameters,  thereby  reducing  the  number  of  parameters  signifi- 
cantly affecting  results  of  certain  problems.   Second,  a  small  subset 
of  parameters  having  the  greatest  impact  on  the  results  can  be 
identified  and  updated. 

For  the  examples  presented,  updating  only  the  most  important  2% 
of  the  model's  parameters  was  sufficient  to  achieve  substantial 
reductions  in  the  tolerance  intervals  for  the  scalar  and  vector 
importance  functions.   As  the  number  of  elements  in  the  importance 
fimction  increased,  the  improvement  was  less  dramatic,  since  more 
parameters  must  be  updated  to  serve  the  broader  classes  of  potential 
problems  to  be  addressed. 

The  ordering  and  relative  importance  of  parameters  thus  determined 
provides  information  for  l)  setting  priorities  for  data  acquisition 
programs  to  update  parameters,  2)  constructing  more  accurate  I-O 
tables  for  a  given  cost,  and  3)  identifying  technologies  where  small 
changes  have  maximum  impact  on  a  policy  objective,  such  as  energy 
conservation. 
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